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We have purified a novel DNA polymerase from Thermus thermophilus. 
This was enabled by use of general gap filling assays to monitor poly- 
merase activity and cross-reactive monoclonal antibodies against the a 
catalytic subunit of E. coli DNA polymerase HI holoenzyme to distinguish 
a novel polymerase from the well characterized DNA polymerase I-like 
Thermus thermophilus DNA polymerase. Two proteins migrating with the 
polymerase after three chromatographic steps were isolated and sub- 
jected to partial amino acid sequencing. The amino termini of both were 
homologous to the two products of the E. coli dnaX gene, the y and x sub- 
units of the DNA polymerase III holoenzyme. Using this information and 
sequences conserved among dnaX-hke genes, we isolated a gene fragment 
by PGR and used it as a probe to isolate the full length Thermus thermo- 
philus dnaX gene. The deduced amino acid sequence is highly homolo- 
gous to the DnaX proteins of other bacteria. Examination of the sequence 
permitted identification of a frameshift site similar to the one used in 
E. coli to direct the synthesis of the shorter y DnaX-gene product. Based 
on this information, we conclude that a conventional replicase exists in 
extreme thermophilic eubacteria. The general biological and practical 
technological implications of this finding are discussed. 
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Introduction 

Escherichia coli, an often studied prototype for 
general eubacterial mechanisms, encodes three dis- 
tinct DNA polymerases. DNA polymerase I is the 
most abundant polymerase and is responsible for 
some types of DNA repair, including a repair-like 
reaction that permits the joining of Okazaki frag- 
ments during DNA replication. DNA polymerase 
IE comprises the catalytic core of the E. coli repli- 
case. There are approximately 400 copies of DNA 
polymerase I per cell, but only 10 to 20 copies of 
Pol m (Kornberg & Baker, 1992; Wu et ah, 1984). 
The low abundance of Pol in and its relatively fee- 
ble activity on gapped DNA templates typically 
used as a general replication assay delayed its dis- 
covery until the availability of mutants defective in 
DNA polymerase I (Kornberg & Gefter, 1972). 

The catalytic subunit of Pol HI is distinguished 
from other polymerases as a component of £. coli 
major replicative complex, apparently not by its 
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intrinsic catalytic activity, but by its ability to inter- 
act with other replication proteins at the fork 
These interactions confer upon the enzyme enor- 
mous processivity. Once the DNA polymerase III 
holoenzyme associates with primed DNA, it does 
not dissociate for over 40 minutes, the time 
required for the synthesis of the entire 4 Mb E. coli 
chromosome (McHenry, 1988). Studies in coupled 
rolling circle models of the replication fork suggest 
the enzyme can synthesize DNA 150 kb or longer 
without dissociation in vitro (Mok & Marians, 1987; 
Wu et ah, 1992). The essential interaction required 
for this high processivity is an interaction between 
the a catalytic subunit and a dimer of p, a sliding 
clamp processivity factor that encircles the DNA 
template like a bracelet, permitting it to rapidly 
slide along with the associated polymerase, but 
preventing it from falling off (LaDuca et ah, 1986; 
Kong et ah, 1992). 

The 02 bracelet cannot spontaneously associate 
with high molecular mass DNA, it requires a mul- 
tiprotein DnaX-complex to open and close it 
around DNA using the energy of ATP hydrolysis 
(Wickner, 1976; Naktinis et al, 1995; Dallmann 
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et al, 1995). In E, coli, the drwX gene encodes two 
proteins, t and y. y is generated by a programmed 
ribosomal frameshifting mechanism five-sevenths 
of the way through dnaX mRNA, placing the ribo- 
some in a —1 reading frame where it immediately 
encounters a stop codon (Flower & McHenry, 
1990; Blinkowa & Walker, 1990; Tsuchihashi & 
Kornberg, 1990). In E. coli, the DnaX-complex has 
the stoichiometry Y 2 T 25i§'iXi*l'i (Dallmann & 
McHenry, 1995). The x protein contains an 
additional carboxyl-terminal domain that interacts 
tightly with the polymerase, holding two poly- 
merases together in one complex that can coordi- 
nately replicate the leading and lagging strand of 
the replication fork simultaneously (McHenry, 
1982, 1988; Studwell-Vaughan & O'Donnell, 1991). 

Pol Ms are apparently conserved throughout 
mesophilic eubacteria. In addition to E. coli and 
related proteobacteria, the enzyme has been puri- 
fied from the firmicute Bacillus subtilis (Low et ah, 
1976; Hammond & Brown, 1992). By inference 
from DNA sequence information available from 
the proliferation of bacterial genomes sequenced, 
Pol III exits in organisms as widely divergent as 
Caulobacter, Mycobacteria, Mycoplasma, B. subtilis 
and Synechocystis. The existence of dnaX and dnaN 
(structural gene for p) is also apparent in these 
organisms. These general replication mechanisms 
are conserved even more broadly in biology. 
Eukaryotes, although they do not contain poly- 
merases homologous to Pol HE, contain special 
polymerases devoted to chromosomal replication, 
(Hike processivity factors (PCNA) and DnaX-like 
ATPases (RFC, Activator I) that assemble proces- 
sivity factors on DNA (Yoder & Burgers, 1991; 
Brush et al, 1995; Uhlmann et al, 1996). 

In spite of the near ubiquity of Pol IHs and their 
associated factors required to function as a repli- 
case, no enzymes of this class have been found in 
thermophilic eubacteria, especially in the closely 
related Thermus aquaticus and Thermus thermophilus 
that has been carefully studies in several labora- 
tories. This raised a paradox. Is the mechanism of 
replication somehow different in thermophiles? 
Here, we describe studies where we definitively 
establish the existence of a Pol EH holoenzyme-like 



subassembly that contains, minimally, a Pol in a 
subunit and associated t and y subunits. The latter 
is apparently produced by a frameshifting mechan- 
ism similar to that used in E. coli. The technical 
implications of the existence of a highly processive 
replicase in an extreme thermophile are also dis- 
cussed. 



Results 

Detection and partial purification of 

T. thermophilus Pot III and associated proteins 

A survey of 12 monoclonal antibodies directed 
against the 130 kDa a subunit of the E. coli Pol HI 
holoenzyme revealed a subset that reacted with a 
protein of approximately the same size in Western 
blots of T. thermophilus extracts. These antibodies 
were used as a tool to distinguish Pol IH-like poly- 
merases from the characterized T. thermophilus 
polymerase during protein fractionation pro- 
cedures. 

Having established that a roughly 130 kDa pro- 
tein cross-reacted with anti-E. coli a monoclonals, 
we optimized a lysis procedure and ammonium 
sulfate fractionation that provided a. We found 
that use of a T. thermophilus strain carrying an 
S-layer T. thermophilus mutation {slpA) made the 
cells suitably sensitive to lysozyme mat a modifi- 
cation of the standard E. coli gentle lysis procedure 
(Cull & McHenry, 1995) provided a satisfactory 
method to produce Fraction I (Table 1). We opti- 
mized ammonium sulfate precipitations to provide 
a nearly quantitative precipitation of the candidate 
T. thermophilus a subunit (judged by Western blots) 
while removing as much contaminating protein as 
possible (Fr II; Table 1). Fraction II was dissolved 
and subjected to BioRex70 cation exchange chro- 
matography. This method did not result in resol- 
ution of DNA polymerases, but did remove 
significant levels of contaminating protein that 
flowed through the column. A 25-fold increase 
in gap-filling polymerase specific activity was 
achieved (Fr m, Table 1). 

Hydrophobic interaction chromatography was 
used to resolve a novel minor polymerase peak 



Table 1. Purification of T. thermophilus Pol III and associated proteins 





Units 


Protein 


Specific activity 


Fraction 


(xlO 3 ) 


(mg) 


(units/mg) 


I. Lysis 


ND* 


73,600 


ND 


II. Ammonium sulfate 


2200 


6540 


360 


HI. BioRex-70 


6700 


686 


10,000 


IV. ToyaPearl-Ether 


1400 


54 


25,000 


V. Q-Sepharose 


760 c 


4.8 


160,000 b 



■ Fraction I was not assayed due to a non-linear and unrealiable response, presum- 
ably due to nuclease or inhibitory contaminants. 

b Assays were conducted at 37°C. The specific activity is fivefold higher (800,000 
units/mg) if assayed at 60°C 

c The reported yield is normalized to a complete preparation using all of the mate- 
rial from Fr IV. We used only 50% of Fr. IV in the preparation of Fr. V. 
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Figure 1. Separation of the major T. thermophilus DNA 
polymerase from Pol ID. A, Hydrophobic interaction 
chromatography (ToyaPearl-Ether 650 M) of Fr III con- 
ducted and analyzed as described in Methods. B, Frac- 
tions from a second minor peak of activity were 
subjected to the Western blotting procedure as described 
in Methods to reveal a band that comigrated with E. coli 
Pol III a subunit. Note that the intensity of the band by 
Western blots parallels polymerase activity in the second 
peak. 



from the majority of T. thermophilus DNA polymer- 
ase activity. Fr HI was dissolved and precipitated 
onto ToyaPearl-Ether 650 M beads by adjusting the 
ammonium sulfate concentration to 1.8 M. Protein 
was eluted with a decreasing ammonium sulfate 
gradient (Figure 1A). The major T. thermophilus 
DNA polymerase eluted early in the gradient and 
did not react with anti-oc monoclonal antibodies. 
Peak 2, a minor peak comprising 9% of the poly- 
merase activity, bound more tightly to the column 
(Fr IV, Table 1). Fractions subjected to a Western 
blotting procedure revealed that the cross-reactive 
130 kDa band eluted with an intensity parallel to 
the polymerase activity in peak 2 (Figure IB). On 
this basis, we identify the second peak of activity 
as a Pol Hi-like polymerase, distinct from the 
characterized T. thermophilus DNA polymerase. 

To provide a higher level of purification to 
enable direct exairunation of the components, we 



developed an additional chromatographic step. 
Fraction IV was subjected to chromatography on 
Q-Sepharose to yield a single peak of polymerase 
activity and a sixfold increase in specific activity 
(Figure 2A; Fr V, Table 1). An SDS/polyacrylamide 
gel of the peak fractions showed that a 130 kDa 
protein eluted in parallel to the activity profile 
(Figure 2B). A 43 kDa protein eluted later and 
did not parallel activity and is presumably a 
contaminant. Two additional proteins of 63(±6) 
and 50(±5) kDa eluted roughly in parallel with 
the 130 kDa protein, providing candidates for 



A 




Figure 2. Purification of T. thermophilus Pol III by anion 
exchange chromatography on Q-Sepharose. A, Chroma- 
tography was conducted and samples analyzed as 
described in Methods. B, Samples (60 ul) were subjected 
to SDS/polyacrylamide gel electrophoresis (10% gel) as 
described. The identities of the purified T. thermophilus 
subunits are designated in the left margin; the migration 
positions of £. coli DNA polymerase III hoioenzyme 
markers are indicated in the right margin. 
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E. coli 3 LYRRFRPLTFQEVVGQEH 20 

L R++RP TF +WGQEH 

Tth gamma 1 SALYRRFRPLTFQEWGQE 19 

Tth tau 1 SALYRRFRPLTFQEWGQEH 20 

ALYR FRP F++WGQEH 

6 . subtilis 5 ALYRVFRPQRFEDVVGQ.EH 23 

Figure 3. Ammo-terminal sequence of the isolated candi- 
date y and x T. thermophilus dnaX gene products arid 
comparison to the homologous sequences of E. coli and 
B. subtilis. (T. thermophilus abbreviated as Tth). Letters 
are placed in intervening lines where identical matches 
occur. A (+) indicates a similarity in aligned residues. 



polymerase Hi-associated proteins, possible sub- 
units of a T. thermophilus Pol m holoenzyme. 

Identification of T. thermophilus homologs of 
the E. coli Pol III holoenzyme y and x subunits 

The 63 and 50 kDa proteins associated with the 
T. thermophilus Pol HI ot subunit were separated by 
SDS PAGE, blotted to a membrane and subjected 
to ammo-terminal sequencing. The 19-residue 
sequence obtained from the 50 kDa protein was 
identical to the first 19 residues of the sequence 
obtained for the 63 kDa protein, consistent with 
both being the product of the same gene (Figure 3). 
The sequenced regions were 60% identical to E. coli 
DnaX, the only bacterial DnaX protein that has 
been directly characterized on the basis of activity. 
It was 70% identical to B. subtilis EhiaX. This is 
highly suggestive that the 63 and 50 kDa proteins 
are the T. thermophilus y and x subunits of Pol III 
holoenzyme. We also obtained an internal peptide 
sequence from the isolated 63 kDa band. The 
sequence: ARLLPLAQAHFGVEEWLVLEGE did 
not show any detectable homology to known dnaX 
genes. 

Cloning and sequencing of the T. thermophilus 
dnaX gene 

To definitively test our hypothesis that the 63 
and 50 kDa proteins are products of the T. thermo- 
philus dnaX gene, we used a PCR approach to 
obtain a fragment of the gene. The gene fragment 
was used as a probe to obtain the full length of T. 
thermophilus DnaX, enabling its full sequence to be 
determined. From the N-tenriinal sequence, a pep- 
tide was selected, FQEWGQ, that would provide 
a PCR primer with the least degeneracy with all 
possible codons represented (512-fold). To provide 
an internal primer, we exploited a KTLEEP amino 
acid sequence common to E. coli, B. subtilis (Carter 
et ah, 1993; O'Donnell et ah, 1993) and many other 
eubacteria. From this consensus sequence a 17-mer 
primer with 256-fold degeneracy was designed 
that included all possible combinations of codons. 
Use of these primers in PCR reactions yielded a 



product of 388 nucleotides, close to the spacing of 
the corresponding regions in the E. coli dnaX gene 
(393 nucleotides). The PCR product was cloned 
and sequenced. The deduced amino acid sequence 
was 42% identical to the corresponding segment of 
the B. subtilis dnaX gene. 

Since we had a large probe that was identical to 
the gene, we decided to use a directed approach 
toward cloning the full length T. thermophilus dnaX 
rather than screening of a full library. We per- 
formed a Southern blot against digested T. thermo- 
philus DNA and found that the PCR probe 
hybridized to a 7.1 kb Psfl fragment that was large 
enough to contain the entire T. thermophilus dnaX 
gene. Psfl-digested DNA was subjected to electro- 
phoretic separation and the region corresponding 
to the sought 7.1 kb fragment was extracted and 
cloned. The resulting colonies were screened by 
colony hybridization and a candidate selected. It 
was sequenced, resulting in detection of a 2142 bp 
open reading frame (Figure 4A). Within the candi- 
date open reading frame GUG (the 186th codon) 
was identified as the actual initiation codon from 
the sequence of the amino terminus; it was 
immediately followed by the experimentally deter- 
mined arruno-terrrunal sequence reported in 
Figure 3. GUG is frequently used as an initiation 
codon in T. thermophilus. Methionine amino-pepti- 
dases would be expected to cleave off the terminal 
methionine (Sherman et al. f 1985), revealing serine 
as determined experimentally. The translation of 
the deduced sequence of the T. thermophilus x sub- 
unit is shown in Figure 4B. We observe a perfect 
match with the internal peptide sequence obtained 
from sequencing of the 63 kDa candidate y subunit 
between residues 428 and 450, further confinrting 
that we isolated the structural gene for the protein 
associated with T. thermophilus Pol HI. Alignment 
of the resulting sequence with the identified E. coli 
dnaX gene and putative homologous genes from 
other eubacteria reveal a high level of identity in 
the amino- terrninal region of all bacteria, confirm- 
ing the identity of the isolated gene to be a dnaX 
homolog (Figure 4C). The molecular mass of the 
predicted T. thermophilus x subunit is 58 kDa, in 
reasonable agreement with the 63(=fc6) kDa deter- 
mined from SDS PAGE. 

In E. coli, the shorter y product of the dnaX gene 
is produced by a translational frameshifting mech- 
anism at the sequence A AAA AAG containing 
two adjacent lysine codons read by the Lys UUU 
anticodon tRNA. A potential frameshift site that 
exploits adjacent AAA Lys codons flanked by A 
residues(A AAA AAA A) is observed at codons 
451 and 452 (Figure 4A). Frameshifting into the —1 
reading frame at this site would result in a 51 kDa 
protein, close to the 50 kDa candidate observed in 
Fr. V (Figure 4D). If, instead, a +1 frameshift 
occurs, a 50 kDa protein would be produced, indis- 
tinguishable within experimental error from the —1 
frameshift alternative. 
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A 

TCCGCCAGGCGCTTTCCCGTGAGGTAGAGGGCCATCTCCACCACCTCGAGCTGCTCC 
GGGGGCGTCACCCGAAGCCCCCCCACGAAGCGGCTCTCGTACCCCAGGCGCCCAAGC 
CAAGCCCCGATCTCGGGCCCGCCCCCGTGGACGAGAACCAAGGGACCAGGGTAGGC 
GGCAAGCTCGTCCAATAGGC^C7CCGCCCC^^ 

GGGCCTCA CTCAA GG TA GA CCA CCTCCTCGGGCA GGCCCTCCTCCTCCVCGGCCTCCA G 

GA GG TCGGCGAA GCCCA GAA GGGCG TGG TCCGGG TGGA GGCCGAA GCGGGCGGCGA G 

GGCCCGGA CCGCG TGCTCGGGCGGGA GGGCCTGGGCCCTCG TCA GGA GGGGGGCGAG 

G TCGGCCGGGGCCTTCA GGGCCG TCCCAAGCTCCGGA CCCGCGCGAAGCTCCGCCCTC 

A CCTCCCCCCCCTCGGCCAGGA GGAA GA GGA GG TCCTCCTCGCTTAGTA CCAGGAA GGC 

CAA GGCCTGCCCTA GGCGGGAAA GCTCCCGCA GGAA GGCCCGGA TGGCCTCCGGGTCC 

TCCTCCGCCTCGA GGGCCTCGTGGTA GA GGCTCG TCCAA GGGGGG TCGGGGA CCA GG T 

AGACCCCCGTCCCTCCCGmCGCCCCmGGCCCAGGCCGCCACCGCCTCCAGGGGGGC 

CTGCACGTGCAA GGAGAGGAAGCTCCGCACCACGCCCTA TACTA GCCC77dfiAGCGCC 

CTCTACCGCCGCTTCCGCCCCCTCACCTTCCAGGAGGTGGTGGGGCAGGAGCACGTG 

AAGGAGCCCCTCCTCAAGGCCATCCGGGAGGGGAGGCTCGCCCAGGCCTACCTCTTC 

TCCGGGCCCAGGGGCGTGGGCAAGACCACCACGGCGAGGCTCCTCGCCATGGCGGT 

GGGGTGCCAGGGGGAAGACCCCCCTTGCGGGGTCTGCCCCCACTGCCAGGCGGTGCA 

GAGGGGCGCCCACCCGGACGTGGTGGAGATTGACGCCGCCAGCAACAACTCCGTGG 

AGGACGTGCGGGAGCTGAGGGAAAGGATCCACCTCGCCCCCCTTTCTGCCCCCAGGA 

AGGTCTTCATCCTGGACGAGGCCCACATGCTCTCCAAAAGCGCCTTCAACGCCCTCCT 

CAAGACCCTGGAGGAGCCCCCGCCCCACGTCCTCTTCGTCTTCGCCACCACCGAGCC 

CGAGAGGATGCCCCCCACCATCCTCTCCCGCACCCAGCACTTCCGCTTCCGCCGCCTC 

ACGGAGGAGGAGATCGCCTTTAAGCTCCGGCGCATCCTGGAGGCCGTGGGGCGGGA 

GGCGGAGGAGGAGGCCCrCCTCCTCCTCGCCCGCCTGGCGGACGGGGCCCTTAGGGA 

CGCGGAAAGCCTCCTGGAGCGCTTCCTCCTCCTGGAAGGCCCCCTCACCCGGAAGGA 

GGTGGAGCGCGCCCTAGGCCTCCCCCCCAGGGAGGCCCTGGCCGAGATCGCCGCCTC 

CCTCGCGAGGGGGAAAACGGCGGAGGCCCTGGGCCTCGCCCGGCGCCTCTACGGGG 

AAGGGTACGCCCCGAGGAGCCTGGTCTCGGGCCTTTTGGAGGTGTTCCGGGAAGGCC 

TCTACGCCGCCTTCGGCCTCGCGGGAACCCCCCTTCCCGCCCCGCCCCAGGCCCTGAT 

CGCCGCCATGACCGCCCTGGACGAGGCCATGGAGCGCCTCGCCCGCCGCTCCGACGC 

CTTAAGCCTGGAGGTGGCCCTCCTGGAGGCGGGAAGGGCCCTGGCCGCCGAGGCCCT 

GCCCCAGCCCACGGGCGCTCCCCCCCCAGAGGTCGGCCCCAAGCCGGAAAGCCCCCC 

GGCCCCGGAACCCCCAAGGCCCGAGGAGGCGCCCGACCTGCGGGAGCGGTGGCGGG 

CCTTCCTCGAGGCCCTCAGGCCCACCCTACGGGCCTTCGTGCGGGAGGCCCGCCCGG 

AGGTCCGGGAAGGCCAGCTCTGCCTCGCTTTCCCCGAGGACAAGGCCTTCCACTACC 

GCAAGGCCTCGGAACAGAAGGCGAGGCTCCTCCCCCTGGCCCAGGCCCATTTCGGGG 

TGGAGGAGGTCGTTCTCGTCCTGGAGGGAGAAAAAAAAAGCCTGAGCCCAAGGCCC 

CGCTCGGCCCCACCTCCTGAAGCGCCCGCACCCCCGGGCCCTCCCGAGGAGGAGGTA 

GAGGCGGAGGAAGCGGCGGAGGAGGCCCCGGAGGAGGCCTTGAGGCGGGTGGTCC 

GCCTCCTGGGGGGGCGGGTGCTCTGGGTGCGGCGGCCCAGGACCCGGGAGGCGCCG 

GAGGAGGAACCCCTGAGCCAAGACGAGATAGGGGGTACTGGTATAIAATGGGGGCA 

TGACGCGGACCACCGACCTCGGACAAGAGACCGTGGACAACATCCTCAAGCGCCTCC 

GCCGTATTGAGGGCCAGGTGCGGGGGCTCCAAAAGATGGTGGCCGAGGGCCGCCCC 

TGCGACGAGGTCCTCACCCAGATGACCGCCACCAAGAAGGCCATGGAGGCGGCGGC 

CACCCTGATCCTCCACGAGTTCCTGAACGTCTGCGCCGCCGAGGTCTCCGAGGGCAA 

GGTGAACCCCAAGAAGCCCGAGGAGATCGCCACCATGCTGAAGAAGTTCATCTAGAT 

GGGTCGGCTTCGGGGGCGCCTCCGGCGCCTCCTCCGGGCCCTTCTCGCCCAGGAGGC 



B 

SALYRRFRPLTFOEVVGOEHVKEPLLKj^REGRLAOAYLFSGPRGVGKTTTARLLAMA 
VGCQGEDPPCGVCPHCQAVQRGAHPDVVEIDAASNNSVEDVREIJIERIHLAPLSAPRKV 
FILDEAHML S K S AFN ALLKTXEEPP PHVLFVF ATTEP ERMPPTIL S RTQHFRFRRLTEEE1 A 
FKLRRilJiAVGREAEEEAIXLLARJ^AIXyAERDAE 

EALAEIAASLARGKTAEALGLARRLYGEGYAPRSLVSGLLEVFREGLYAAFGLAGTPLPA 
PPQALIAAMTALDEAMERLARRSDALSLEVALLEAGRALAAEALPQPTGAPPPEVGPKPE 
S PP APEP PRPEEAPDLRERWRAFLEALRPTLRAFVRE ARPEVREGQLC L AFPEDKAFHYR 
KASEQK ARLLFl^OAHFGVEEVVLVLEGE KKSLSPRPRSAPPPEAPAPPGPPEEEVEAE 
EAAEEAPEEALRRVVRLLGGRVLWVRRPRTREAPEEEPLSQDE1GGTGI 



Figure 4(A-B) {legend opposite) 
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GAAAAAAAAAGCCTGAGCCCAAGGCCCCGCTCGGCCCCACCTCCTGA 
EKKSLSPRPRSAPPP 
-1 EKKKPE PKAPLGPTS 
+ 1 E K K A 

Figure 4. Sequence and analysis of the full length T. thermophilus dnaX gene. A, DNA sequence of T. thermophilus 
dnaX and flanking sequences. Note that the initiating GTG was the 186th codon in a long open reading frame and 
could only be conclusively identified by use of the ammo-terminal sequence of protein purified from T. thermophilus. 
The preceding sequences in the open reading frame are italicized. Also in bold and underlined is the AAAAAA 
sequence that, by analogy to E. coli, is probably the frameshifting site that permits synthesis of the shorter y product. 
The potential stop codons for y in Hie +1 (first) and —1 reading frame are dotted /underlined. The stop codon for the 
full length x translation product is underlined. B, The translation of the x gene product of T. thermophilus dnaX. The 
protein sequences directly determined from the isolated T. thermophilus DnaX-proteins are underlined. C, Comparison 
of T. thermophilus dnaX with homologous eubacterial dnaX sequences. To provide the most useful and concise presen- 
tation, only examples from widely divergent organisms are presented. Organisms listed from the top are: Tth (T. ther- 
mophilus; Chloroflexaceae/Deinococcaceae group), E.coli (proteobacteria group, gamma division), B.sub. (B. subtilis, 
firmicute group, low G + C Gram-positive bacteria division), Mycopl {Mycoplasma pneumoniae, firmicute group, 
mycoplasma division), Caulo (Caulobacter crescentus, proteobacteria group, alpha division), $yn.sp. (Synechocystis sp., 
Cyanobacteria group, (blue-green algae)). Sequences shaded in black are identical among the indicated bacteria; 
sequences shaded in gray are similar. The alignment was not shown beyond T. thermophilus DnaX residue 240 
because it was insignificant. D, Deduced sequence of T. thermophilus y subunit depending on whether the frameshift 
is —1 (as in E. coli) or -hi as would be permitted from the sequence as described in Discussion. The sequence preced- 
ing the sequence shown should be identical for t and y. 
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Discussion 

Evidence from biochemistry and genomics 
suggests that the mechanism of DNA replication is 
uniform throughout biology. Yet, studies in ther- 
mophilic eubacteria have not led to the discovery 
of Pol III holoenzyme-like replicases. Armed with 
a bank of specific monoclonal antibodies against 
the a subunit of Pol III and our present knowledge 
of the properties of Pol HI complexes, we took a di- 
rected approach in a search for Pol HI holoenzyme- 
like complexes in T. thermophilus. 

The initial cation exchange step was used 
because polymerases, in general, tend to interact 
strongly with cation exchangers, presumably 
through active site nucleic acid binding residues 
since the overall proteins are acidic. This resulted 
in isolation of an enriched polymerase fraction, but 
did not resolve the Pol DT antibody reactive frac- 
tion from the majority of polymerase activity. £. coli 
Pol HI holoenzyme binds tightly to hydrophobic 
columns, affording an effective purification step 
(McHenry & Kornberg, 1977). Thus, we attempted 
hydrophobic interaction chromatography in the 
hope of resolving T. thermophilus Pol HI from the 
smaller and, presumably, more hydrophilic DNA 
polymerase I-like activity. The chromatographic 
profile, based on gap-filling polymerase activity, 
revealed a major loosely bound peak and a more 
tightly bound minor peak. The second tightly 
bound polymerase cross-reacted specifically with 
E. coli anti-a monoclonals; the first peak did not. 
An additional chromatographic step on the 
resolved polymerase afforded material where the 
candidate a and two apparently comigrating com- 
ponents of 63 and 50 kDa comprised approxi- 
mately 50% of the protein resolved on an SDS gel. 
The 63 kDa protein chromatographed in parallel 
with the polymerase. The 50 kDa protein was more 
strongly represented toward the early portion of 
the peak. 

The associated 63 and 50 kDa proteins were sub- 
jected to airrino-terrriinal sequencing, revealing 
identical termini, consistent with their being the 
product of the same gene. The 20-residue terminal 
sequence was 60 and 70% identical to amino-term- 
inal sequences of the E. coli and B. subtilis DnaX 
proteins, respectively. The association of a T. ther- 
mophilus DnaX homolog with a protein that cross- 
reacts with anti-pol III a monoclonal antibodies 
and is the same molecular mass as the E. coli Pol 
HI a subunit in a 50% pure protein preparation, we 
argue, establishes the existence of a Pol IIT holoen- 
zyme-like complex in T. thermophilus. Although we 
do not have a holoenzyme specific assay that 
enables rigorous quantitation of the extent of puri- 
fication, Pol HI of E. coli needed to be purified 
30,000-fold (McHenry & Crow, 1979). If the T. ther- 
mophilus polymerase ni has been purified 15,000- 
fold, the chances of a dnoX homolog co-purifying 
by chance is exceedingly remote. Since we do not 
yet possess the T. thermophilus p subunit, we can- 



not determine whether the isolated complex is 
active in replication complex assembly. 

To confirm the identity of the associated DnaX- 
like components, we isolated their structural gene 
by a reverse genetic approach. We isolate a PCR 
fragment of the gene using the ammo-terminal 
sequence experimentaUy obtained and an internal 
peptide sequence that is conserved among eubac- 
terial DnaX proteins. We obtained a PCR frag- 
ment of the predicted size that was highly 
homologous to dnaX genes in the fragment 
between the regions corresponding to the PCR 
primers, elirninating the bias imposed by primer 
selection. The PCR-genera ted probe led to the iso- 
lation of a full length gene (Figure 4A) that is 
highly homologous to eubacterial dnaX genes. 
Every region conserved among other eubacterial 
dnaX genes is found represented in T. thermophi- 
lus dnaX. These sequences include the Walker- 
type ATP binding site, represented by GVGKTTT 
in T. thermophilus dnaX, and highly conserved 
EIDAAS and FNALLKTLEEP sequences 
(Figure 4C). The conservation beyond the first 
220 residues falls off for all dnaX genes. Thus, the 
internal peptide sequence obtained that starts at 
residue 427 (second underlined sequence in 
Figure 4B) was useful in confirming that the 
DNA sequence was in the correct reading frame 
and providing further confirmation that we have 
isolated the structural gene for the same proteins 
isolated in association with T. thermophilus Pol m. 

The specific activity of the highly purified T. 
thermophilus Pol m was 800,000 units /mg at 60°C 
The specific activity of pure E. coli Pol IH is 
2.5 x 10 6 at 30°C (Kim & McHenry, 1996). After 
correction for the presence of contarninants and the 
associated x and y proteins, the specific activity of 
T. thermophilus Pol ni core is ca 2 x 10 6 , close to 
the activity of its E. coli counterpart when com- 
pared at temperatures slightly below their optimal 
growth temperature. 

Our studies also provide information regarding 
a conservation of a frameshifting mechanism to 
generate related ATPases that, by analogy to 
E. coli, can both assemble a 0 processivity factor 
onto primed DNA. Both a 63 kDa x subunit that 
has a molecular mass consistent with its being a 
full length dnaX translation product and a 50 kDa 
y subunit that likely arises by translational frame- 
shifting were detected in enzyme purified from 
T. thermophilus extracts. Examination of the dnaX 
DNA sequence is consistent with this hypothesis. 
In E. coli, ribosomes frameshift at the sequence A 
AAA AAG into a —1 frame where the lysine 
UUU anticodon tRNA can base-pair with six As 
before elongating (Flower & McHenry, 1990; 
Blinkowa & Walker, 1990; Tsuchihashi & 
Kornberg, 1990). In T. thermophilus, the putative 
frameshift site has the sequence A AAA AAA A, 
which would enable either a +1 or —1 frameshift. 
The +1 frameshift product would extend only 
one residue beyond the Lys-Lys encoding 
sequence where the frameshift occurs, similar to 
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the E. coli — 1 product (Figure 4D). A —1 frame- 
shift would encode a protein with a 12-amino 
acid extension. Such an extension could permit 
an interaction that may further distinguish y 
from x functionally or could loop back to stabil- 
ize its structure in a thermal environment. Resol- 
ution of this issue will require further 
investigation since the two alternatives differ by 
ca 1 kDa, less than the resolution of SDS PAGE. 

To date only one DNA polymerase has been dis- 
covered in thermophilic eubacteria, even though 
others have been actively sought (Chien et al, 
1976, Kaledin et al, 1980; Lawyer et al, 1989). Our 
studies establish that one extreme thermophile has 
a Pol Ill-class polymerase and an associated Pol III 
holoenzyme auxiliary subunit homolog. This pro- 
vides strong evidence for the existence of the stan- 
dard Pol IE holoenzyme-replication mechanism in 
this one thermophile and establishes a principle 
that could be extended to other thermophiles. 
Since T. thermophilus (and T. aquaticus) DNA poly- 
merase is homologous to DNA polymerase I 
(Lawyer et al, 1989) and since we have established 
the existence of an additional polymerase in this 
organism, we propose the enzyme be referred to as 
T. thermophilus DNA polymerase I in the future to 
distinguish the two enzymes. 

E. coli Pol HI holoenzyme can remain associated 
with primed DNA for 40 minutes and replicates 
DNA at 500 to 1000 nucleotides/second (McHenry, 
1988). Existing PCR technology is limited by rela- 
tively non-processive repair-like DNA poly- 
merases. Our studies reveal the existence of a 
thermophilic replicase that, by inference, likely pro- 
cesses rapid replication and highly processive 
properties at elevated temperatures. It is likely that 
such an enzyme can be developed into a tool for 
megabase PCR, removing the current length 
restrictions and enabling new technological 
advances in molecular biology. 

Methods 
General methods 

Protein determinations were conducted using 
the Coomassie Protein Assay Reagent from Pierce 
and BSA as a standard. DNA manipulations, clon- 
ing, Southern Blots, and standard molecular bio- 
logical methods were carried out as described 
(Ausubel et al., 1995). Conductivity measurements 
were performed using a Radiometer CDM83 con- 
ductivity meter on samples diluted 1:100 in dis- 
tilled water equilibrated to room temperature. 
Standards were run in parallel. 

Identification of monoclonal antibodies that 
cross-react with T. thermophilus a subunit of 
Pol III 

In general, large asymmetric complexes are 
relatively insoluble in ammonium sulfate. 
Ammonium sulfate fractionation provides a 50- 



fold purification of the E. coli Pol III holoen- 
zyme. We took a low ammonium sulfate cut of 
T. thermophilus extracts to provide a source of 
protein enriched sufficiently in Pol III that we 
could use an ELISA assay to screen 12 mono- 
clonal antibodies directed against the £. coli Pol 
m a subunit (M. Olson & C. S. M, unpub- 
lished) to see if they cross-reacted with a T. 
thermophilus protein. The ammonium sulfate 
fraction was prepared by addition of 0.246 g 
ammonium sulfate to each ml of Fr I using the 
same approach as described for the preparation 
of Fr II in the Pol III preparation described 
below. For the ELISA screening assay, all 
manipulations were conducted at room tempera- 
ture. Into each well of a 96-well micro titer plate 
(Corning Costar High Binding EIA/RIA) was 
placed 4ug protein in 150 nl Buffer E7 (10 mM 
Tris-HCl (pH 7.5), 150 mM NaCl, 0.05% Tween- 
20). After an overnight incubation, each well 
was blocked by incubation with buffer 
E7 + 10 jig/ml BSA for three hours, followed by 
three washes with buffer E7 + 10 mg/ml BSA 
and the addition of 150 ul of hybridoma super- 
natant and a 3.5 hours incubation. Wells were 
then washed three times with buffer E7 and 
once with an equivalent buffer that had been 
adjusted to pH 8.8 (buffer E8). An amount 
(150 ul) of a 1:3000 dilution of goat anti-mouse 
IgG antibody-alkaline phosphatase conjugate 
(BioRad) was added and incubated for one 
hour. Wells were washed three times with buf- 
fer E7, once with buffer E8 and developed in 
the presence of p-nitrophenyl phosphate for ten 
minutes as described (Harlow & Lane, 1988). 
Absorbance was read at 405 ran Monoclonal 
antibodies produced from hybridoma lines 
C1950-F3 and C1104-H2 gave an absorbance of 
0.17 and 0.16, respectively; all other candidate 
monoclonal supernatants gave an absorbance of 
0.06 on average. We combined equal volumes 
of both supernatants for future work. An 
amount (40 ug) of the 0.246 ammonium sulfate 
fraction of T. thermophilus Fr I was subjected to 
the Western blotting procedure described below. 
A band migrating with the same mobility as 
the a subunit of Pol EH of E. coli was detected. 
Control experiments using several different 
monoclonal antibodies did not result in detec- 
tion of the same band. 

Gap filling polymerase assay 

The gap filling assay used, is the same as that 
used to monitor the purification of E. coli Pol III 
core (Kim & McHenry, 1996) except that the assay 
mixture contained 32 mM Hepes (pH 7.5), 13% gly- 
cerol, 0.01% Nonidet P40, 0.13 mg/ml BSA, 10 mM 
MgCl^ 0.2 mg/ml activated calf-thymus DNA, 
57 nM each of dGTP, dATP, dCTP and 21 uM 
[ 3 H]dTTP (360 cpm/pmol dTTP). The reaction was 
started by the addition of polymerase in a 15 jil 
reaction mixture and incubated at 37°C for five to 
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ten minutes. Acid insoluble DNA product was 
recovered and quantified as described (Kim & 
McHenry, 1996). The assay was performed at 37°C, 
in part for convenience / but also because the 
decrease in activity for Pol HI at lower tempera- 
tures is less than that of the major T. thermophilus 
DNA polymerase activity, permitting more effi- 
cient detection of Pol III because it comprised a 
greater percentage of total polymerase activity 
when assayed this way. One unit of activity is 
equivalent to 1 pmol total nucleotide synthesis per 
minute at 37°C. The ratio of activity for Fr V T. 
thermophilus Pol III between 37°C and 60°C is five- 
fold. 

Western blotting procedures 

Aliquots of column fractions (50 ul) were sub- 
jected to electrophoresis on a 10% SDS/polyacryl- 
amide gel. Proteins were then transferred from 
the gel to a PVDF membrane (BioRad). Transfer 
buffer contained 25 mM Tris, 192 mM glycine, 
20% methanol and was adjusted to pH 8.5 by the 
addition of HCL Transfer was conducted for two 
hours at 70 V (0.7 A). The membrane was then 
washed in 10 mM Tris-HCl (pH 7.5), 150 mM 
NaCl, 0.05% Tween-20 followed by incubation for 
one hour in the same buffer with 5% dried milk. 
The blot was then incubated overnight with 
hybridoma supernatant, washed (three times) in 
100 ml buffer TBS containing 0.5% Tween-20, 
incubated for one hour with a 1:2000 dilution of 
goat anti-mouse IgG alkaline phosphatase conju- 
gate (BioRad) in buffer TBS containing 0.5% 
dried milk, washed (three times) in 100 ml buffer 
TBS + 0.5% Tween-20 and once in 100 ml buffer 
P (100 mM Tris-HCl (pH 9.5), 100 mM NaCl, 
5 mM MgCl 2 ). The membrane was developed by 
incubating in a 1:150 dilution of a GIBCO nitro- 
blue tetrazolium stock solution and a 1 :300 
dilution of a 5-bromo-4-chloro-3-indoyl phosphate 
stock solution in buffer B until the intensity of 
the a band reached the desired intensity. The 
development of color was stopped by washing 
the membrane in water. 

Partial purification of T. thermophilus Pol III a 
subunit and associated proteins. 

T. thermophilus strain pMF48KAT (Lasa et ah, 
1992) was grown in 180 liter batches with aeration 
in a 250 liter fermentor at 72° C in a medium con- 
taining (per liter): 0.27 g ferric chloride hexahy- 
drate, 0.294 g sodium citrate trisodium salt 
dihydrate, 0.025 g calcium sulfate dihydrate, 0.20 g 
magnesium chloride hexahydrate, 0.53 g am- 
monium chloride, 8 g pancreatic digest of casein 
(DMV International), 4g yeast extract (Ardamine 
Z, Champlain Industries), 4.0 g glucose (Cerelose 
2001, food grade (Corn Products, International), 
0.5 g L-glutamic acid monosodium salt, 0.254 g 
sodium phosphate monobasic (dissolved in 5.6 ml 
water), 1.5 g dipotassium phosphate (dissolved in 



11.1 ml water). The medium was maintained at pH 
7.3 by the addition of 3 M NaOH. An additional 
five liter solution of glucose (480 g/1) and L-gluta- 
mic acid (60 g/1) was added incrementally during 
the course of the fermentation run. A 250 ml over- 
night inocculum (grown in the same med- 
ium + 30 ug/ ml kanamycin) was used. Cells were 
harvested by passage through a heat exchanger in 
a Sharpies AS 16 centrifuge and resuspended in an 
equal weight of Tris-sucrose (50 mM Tris-HCl (pH 
7.5), 10% sucrose) frozen in liquid N 2 and stored at 
-20°C until used. 

4.7 kg of a 1:1 suspension of cells in Tris-sucrose 
were added to 6.5 liters Tris-sucrose that had been 
prewarmed to 55°C. To the stirred mixture was 
added 117 ml 0.5 M DTT and 590 ml of 2 M NaCl, 
0.3 M spermidine in tris-sucrose adjusted to pH 
7.5. The pH of the slurry was adjusted to pH 8 by 
the addition of 2 M Tris base and 2.35 g lysozyme 
was added. The slurry was distributed into 250 ml 
centrifuge bottles and incubated at 30°C for one 
hour with occasional inversion and then centri- 
fuged at 23,000 g for 60 minutes at 4°C. The recov- 
ered supernatant (eight liters) constituted Fr I 
(Table 1). 

To Fr I was added ammonium sulfate (0.267 g/ 
initial ml Fr I) over a 15 minute interval. The mix- 
ture was stirred for an additional 30 minutes at 
4°C and then centrifuged at 23,000 g for 60 minutes 
at 0°C. The recovered pellet was resupended (on 
ice) in 563 ml (0.07 x Fr I volume), 50 mM Tris- 
HCl (pH 7.5), 20% glycerol, 1 mM EDTA, 0.1 M 
NaCl, 5 mM DTT, 0.18 g (added to each ml of final 
solution) ammonium sulfate and the resulting sus- 
pension centrifuged at 23,000 # for 60 minutes at 
4°C resulting in Fr II (Table 1). 

A BioRex 70 cation exchange chromatography 
step was developed that resolved polymerase ac- 
tivity from the majority of contaminating protein. 
Fr H was resuspended in 100 ml buffer U (50 mM 
Imidazole-HCl (pH 6.8), 20% glycerol, 35 mM 
ammonium sulfate, 1 mM magnesium acetate, 
0.1 mM zinc sulfate, 5 mM p-mercaptoethanol, 
0.1 mM ATP) and dialyzed twice successively ver- 
sus two liters buffer U. Dialysate was applied to 
a 300 ml BioRex 70 (BioRad, 100 to 200 mesh, 
5.5 cm diameter) column equilibrated in buffer U 
and washed with 0.9 liter buffer U. Activity was 
eluted with a 1.5 liter 0 to 300 mM NaCl gradient 
in buffer U. All gradient eluted fractions contain- 
ing greater than 20,000 units of gap-filling poly- 
merase activity/ml were pooled, constituting Fr. 
HI (Table 1). 

Hydrophobic interaction chromatography was 
used to resolve a unique polymerase that cross- 
reacted with antibody directed against E. coli Pol 
III a subunit. Fr HI protein was precipitated by the 
addition of an equal volume of saturated 
ammonium sulfate and the pellet was collected by 
centrifugation at 23,000 g, one hour, 4°C. The pellet 
was dissolved in 40 ml buffer U and 20 ml Toya- 
Pearl-Ether 650 M (Toso Haas) equilibrated in buf- 
fer U + 1.6M ammonium sulfate was added. To 
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the stirred suspension was added (drop wise) 
33.6 ml 4 M ammonium sulfate and the entire mix- 
ture was applied to an 80 ml ToyoPearl-Ether col- 
umn (3.5 cm diameter) equilibrated in buffer 
U + 1.6M ammonium sulfate. The column was 
washed with 80 ml buffer U + 1.1M ammonium 
sulfate and polymerase activity eluted with a 1.5 
liter 1.1 to 0.3 M ammonium sulfate gradient in 
buffer U (Figure 1). Fractions containing polymer- 
ase activity were subjected to the Western blotting 
procedure. The second peak (fractions 45 to 50) 
that contained polymerase activity and reacted 
with monoclonal antibodies against E. coli Pol m a 
subunit were pooled, constituting Fr IV (120 ml, 
Table 1, Figure 1A,B). 

One-half of Fr IV was combined with an equal 
volume of saturated ammonium sulfate (4°C) and 
the resulting precipitate collected by centrifugation 
and redissolved in 4 ml buffer U and dialyzed 
overnight versus 250 ml buffer U (one buffer 
change) and the dialysate was applied to a 5 ml Q- 
Sepharose fast flow column (1.4 cm diameter) equi- 
librated in buffer U. The column was washed with 
15 ml buffer U and polymerase activity was eluted 
with a 75 ml 0 to 275 mM NaCl gradient in buffer 
U. A contaminating protein eluted toward the end 
of the activity peak; thus, fractions were carefully 
selected for pooling based on purity. All fractions 
that contained greater than 50 ug protein and a 
specific activity greater than 104,000 were pooled 
(Fr. 28 to 36) to yield Fr V (25 ml, Table 1, Figure 2). 

Procedures used in obtaining protein sequence 

N-terminal sequencing was performed on the t 
(63kDa) and 7 (50kDa) proteins (Fr. V, 120 ug 
total protein) separated on a 10% SDS polyacryl- 
amide gel and transferred to a Hyperbond (BioR- 
ad) PVDF membrane (Matsudaira, 1987). The 
appropriate Coomassie blue stained band was cut 
out and sequenced on an ABS 477A Protein 
Sequencer according to manufacturer's instruc- 
tions. N-terminal sequencing was performed by Dr 
James McManaman in the UCHSC Cancer Center 
Protein microsequencing facility. 

Internal peptide sequences were determined at 
the Harvard Microchemistry facility directed by Dr 
Wm Lane. Their standard procedures were fol- 
lowed. An aliquot of Fr. V (230 ug) was subjected 
to electrophoresis on a 10% polyacrylamide gel 
and the fractionated proteins transferred to a 
PVDF membrane (BioRad). The Ponseau-S stained 
63 kDa band was cut out and subjected to diges- 
tion by endo Lys-C, the resultant peptide separated 
by HPLC and one peptide was chosen for sequen- 
cing. 

Isolation of T. thermophilus dnaX probe by PCR 

Forward primers were selected from the amino- 
terminal protein sequence that would provide the 
least degeneracy; nevertheless, two separate pri- 
mers had to be synthesized so that no primer was 



more than 512-fold degenerate. Primer XI Fa (5'- 
TTY CAR GAR GTN GTN GGW CA) and Primer 
XlFb (5'-1TY CAR GAR GTN GTN GGS CA) dif- 
fered only in the sixth codon. For the reverse pri- 
mer, Primer X139R was used (5'-GGY TCY TCN 
ARN GTY IT; 256-fold degenerate). For PCR reac- 
tions, the Boehringer Mannheim Expand™ long 
template PCR system was used and we followed 
the manufacturer's recommendations except as 
noted. Reactions were conducted in Boehringer 
Expand™ buffer 1 supplemented with 0.5 mM 
extra Mg 2 " 1 ". The annealing steps were conducted at 
48°C, elongation at 68°C and the melting step at 
94° C; 26 total cycles were run. Products were sep- 
arated on 2% FMC Metaphor agarose gels, visual- 
ized by Sybr Green-I staining, extracted and cloned 
into vector pCRQ (Invitrogen). A plasmid isolate 
containing an EcoRI fragment of the predicted size 
based upon the distance between the homologous 
region of the E. coli dnaX gene (393 bases) was sub- 
mitted for DNA sequencing and was shown to 
encode a 388 bp segment highly homologous to 
other bacterial dnaX genes (42% identical to B. sub- 
tilis dnaX in the segment between the regions corre- 
sponding to the primers). 

Cloning and sequencing of full-length 
T. thermophilus dnaX gene 

A Pstl digest of T. thermophilus chromosomal 
DNA was subjected to electrophoretic separation 
on an agarose gel and the region corresponding to 
a 7.1 kb fragment that hybridized with the partial 
dnaX probe generated by PCR was extracted and 
cloned into vector pMGC707. pMGC707 had been 
prepared by cutting pCRU (Invitrogen) with Spel 
and Notl and inserting a polylinker resulting from 
the annealing oligonucleotides (5'-GGCCGC- 
AATTGCACGCGTTCGAATTCCATGACGTCTTC- 
CAGTGCACTGGTTAATTA) and (S'-CTAGTTA- 
ATTAACCAGTGCACTGGAAGACGTCATGG- 
AATTCGAACGCGTGCAATTG). The polylinker 
region of the resulting plasmid was cleaved with 
BsfXI to generate Pstl compatible terrnini to receive 
the extracted 7.1 kb population of fragments. Colo- 
nies were screened by colony hybridization (Ausu- 
bel et al, 1995) using the T. thermophilus dnaX PCR- 
generated probe. Plasmids from positive colonies 
were purified and those containing 7.1 kb inserts 
and also a BamHI site as indicated from the 
sequence of the PCR probe were retained for 
further characterization. One was submitted for 
full DNA sequencing (Lark Technologies, Houston, 
TX), resulting in the sequence of the full length 
gene (Figure 4). 
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Alignment file 
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i Colours 


Guide tree file 
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Your input file 


dustalw-2005041 0-1 6395846.input 




[ SUBMIT ANOTHER JOB ] 



To save a result file right-dick the file link in the above table and choose 'Save Target As". 

If you cannot see the JalView button, reload the page and check your browser settings to enable Java A$ 



Scores Table 



[ View Output File ] 



SeqA Name Len(nt) SeqB Name Len(nt) Score 



1 McHenry 2681 2 Yurieva 2007 98 



PLEASE NOTE: Some scores may be missing from the above table if the alignment was done using mul 



[ View Output File ] 



http://vww.ebi.ac. ul^ 1 0- 1 63 ... 4/10/2005 



ClustalW 
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Alignment 

[ Show Colors ] [ View Alignment File ] 

CLUSTAL W (1-82) multiple sequence alignment 



McHenry 
Yurieva 



TCCGCCAGGCGCTTTCCCGTGAGGTAGAGGGCCATCTCCACCACCTCGAGCTC 



McHenry 
Yurieva 



GGCGTCACCCGAAGCCCCCCCACGAAGCGGCTCTCGTACCCCAGGCGCCCAAC 



McHenry 
Yurieva 



CCGATCTCGGGCCCGCCCCCGTGGACGAGAACCAAGGGACCAGGGTAGGCGG( 



McHenry 
Yurieva 



TCCAATAGGGCCTCCGCCCCCCTAAGGCTTCCTCCCACCTTCACCAAAAGGGC 



McHenry 
Yurieva 



AAGGTAGACCACCTCCTCGGGCAGGCCCTCCTCCTCCTCGGCCTCCAGGAGG'. 



McHenry 
Yurieva 



GCCCAGAAGGGCGTGGTCCGGGTGGAGGCCGAAGCGGGCGGCGAGGGCCCGGi 



McHenry 
Yurieva 



CTCGGGCGGGAGGGCCTGGGCCCTCGTCAGGAGGGGGGCGAGGTCGGCCGGG( 



McHenry 
Yurieva 



GGCCGTCCCAAGCTCCGGACCCGCGCGAAGCTCCGCCCTCACCTCCCCCCCC. 



McHenry 
Yurieva 



GAGGAAGAGGAGGTCCTCCTCGCTTAGTACCAGGAAGGCCAAGGCCTGCCCTi 



McHenry 
Yurieva 



AAGCTCCCGCAGGAAGGCCCGGATGGCCTCCGGGTCCTCCTCCGCCTCGAGG< 



McHenry 
Yurieva 



GTAGAGGCTCGTCCAAGGGGGGTCGGGGACCAGGTAGACCCCOGTCCCTCCC( 

TCCGGGGGTGG — GGTTCCCAGGTAGACGCCGGCCCCTCCC( 

*** *** ** ** ************* ★ ******** 



McHenry 
Yurieva 



CTCGGCCCAGGCCGCCACCGCCTCCAGGGGGGCCTGCACGTGCAAGGAGAGGi 

CTTTACCCAGGCCGCCACCTCCTCCAGGGGGGCCAAGGOGTGCAAGGAGAGG/ 
** ************** ************** *************** 



McHenry 
Yurieva 



caccacgccctatactagcccttgtgagcgccctctac<xk:c^ttccgccc( 
c^ccacgccctatactax3cc-ttgtgagcgcxx:tctagcgocgcttccgccc( 

******************** *******************************t 



McHenry 
Yurieva 



tccaggaggtggtggggcaggagcacgtgaaggagcccctcctcaaggccatc 
tccaggaggtggtggggcaggagcacgtgaaggagcccctcctcaaggccat< 

********************************************** *.* ****-. 



McHenry 



GGAGGCTCGCCCAGGCCTACCTCTTCTCCGGGC<Xy«3GGGCGTGGGCAAGAC( 
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Yurieva 



GGAGGCTCGCCCAGGCCTACCTCTTCTCCGGGCCCAGGGGCGTGGGCAAGAC( 
****************************************************^ 



McHenry 
Yurieva 



CGAGGCTCCTCGCCATGGCGGTGGGGTGCCAGGGGGAAGACCCCCCTTGCGG< 

CGAGGCTCCTCGCCATGGCGGTGGGGTGCCAGGGGGAAGACCCCCCTTGCGG( 
****************************************************: 



McHenry 
Yurieva 



CCCACTGCCAGGCGGTGCAGAGGGGCGCCCACCCGGACGTGGTGGAGATTGAC 
CCCACTGCCAGGCGGTGCAGAGGGGCGCCCACCCGGACGTGGTGGACATTGAC 

********************************************** *****: 



McHenry 
Yurieva 



GCAACAACTCCGTGGAGGACGTGCGGGAGCTGAGGGAAAGGATCCACCTCGC< 

GCAACAACTCCGTGGAGGACGTGCGGGAGCTGAGGGAAAGGATCCACCTCGC( 
***************************************************** 



McHenry 
Yurieva 



CTGCCCCCAGGAAGGTCTTCATCCTGGACGAGGCCCACATGCTCTCCAAAAG( 
CTGCCCCCAGGAAGGTCTTCATCCTGGACGAGGCCCACATGCTCTCCAAAAG( 
****************************************************^ 



McHenry 
Yurieva 



ACGCCCTCCTCAAGACCCTGGAGGAGCCCCCGCCCCACGTCCTCTTCGTCTT( 
ACGCCCTCCTCAAGAGCCTGGAGGAGCCCCCGCCCCACGTCCTCTTCGTCTTC 



McHenry 
Yurieva 



CCGAGCCCGAGAGGATGCCCCCCACCATCCTCTCCCGCACCCAGCACTTCCGC 
CCGAGCCCGAGAGGATGCCCCXICACCATCCTCTCCCGCACCCAGCACTTCCGC 

****************************************************: 



McHenry 
Yurieva 



GCCTCACGGAGGAGGAGATCGCCTTTAAGCTCCGGCGCATCCTGGAGGCCGTC 
GCCTCACGGAGGAGGAGATCGCCTTTAAGCTCCGGCGCATCCTGGAGGCCGT( 



McHenry 
Yurieva 



AGGCGGAGGAGGAGGCCCTCCTCCTCCTCGCCCGCCTGGCGGACGGGGCCCT 
AGGCGGAGGAGGAGGCCCTCCTCCTCCTCGCCCGCCTGGCGGACGGGGCCCT r . 

********* *********************************** ********! ' 



McHenry 
Yurieva 



CGGAAAGCCTCCTGGAGCGCTTCCTCCTCCTGGAAGGCCCCCTCACCCGGAAC 

CGGAAAGCCTCCTGGAGCGCTTCCTCCTCCTGGAAGGCCCCCTCACCCGGAAC 
****************************************************•. 



McHenry 
Yurieva 



AGCGCGCCCTAGGCCTCCCCCCCAGGGAGGCCCTGGCCGAGATCGCCGCCTC( 
AGCGCGCCCTAGGCTCCCCCCCAGGGACCGGGGTGGCCGAGATCGCCGCCTCC 
************** ****** ** * ******************** 



McHenry 
Yurieva 



GGGGGAAAACGGCGGAGGCCCTGGGCCTCGCCCGGCGCCTCTACGGGGAAGGt 
GGGGGAAAACGGCGGAGGCCCTGGGCCTCGCCCGGCGCCTCTACGGGGAAGGC 

****************************************************: 



McHenry 
Yurieva 



CGAGGAGCCTGGTCTCGGGCCTTTTGGAGGTGTTCCGGGAAGGCCTCTACGC< 
CGAGGAGCCTGGTCTOGGGCCTTTTGGAGGTGTTCCGGGAAGGCCTCTACGCC 
*************************************************** 



McHenry 
Yurieva 



GCCTCGCGGGAACCCCCCTTCCCGCCCCGCCCCAGGCCCTGATCGCCGCCAT( 
GCCTCGCGGGAACGCCCCTTCCCGCCCCGCCCCAGGCCCTGATCGCCGCCATC 



McHenry 
Yurieva 



TGGACGAGGCCATGGAGCGCCTCGCCCGCCGCTCCGACGCCTTAAGCCTGGAC 
TGGACGAGGCCATGGAGCGCCTCGCCCGCCGCTCCGACGCCTTAAGCCTGGAC 

******************************************** * * ******: 



McHenry 
Yurieva 



TCCTGGAGGCGGGAAGGGCCCTGGCCGCCGAGGCCCTGCCCCAGCCCACGGGC 
TCCTGGAGGCGGGAAGGGCCCTGGCCGCCGAGGCCCTACCCCAGCCCACGGGC 
************************************* **************- 



McHenry 
Yurieva 



CCCCAGAGGTCGGCCCCAAGCOGGAAAGCCCCCCGGCCCCGGAACCCCCAAG< 
CCCCAGAGGTCGGCCCCAAGGCGGAAAGCCCCCCGACCCCGGAACCCCCAAGC 
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McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



AGGCGCCCGACCTGCGGGAGCGGTGGCGGGCCTTCCTCGAGGCCCTCAGGCCf 
AGGCGCCCGACCTGCGGGAGCGGTGGCGGGCCTTCCTCGAGGCCCTCAGGCC( 

****************************************************^ 

GGGCCTTCGTGCGGGAGGCCCGCCCGGAGGTCCGGGAAGGCCAGCTCTGCCT( 
GGGCCTTCGTGCGGGAGGCCCGCCCGGAGGTCCGGGAAGGCCAGCTCTGCCTC 
****************************************************-. 

CCGAGGACAAGGCCTTCCACTACCGCAAGGCCTCGGAACAGAAGGCGAGGCT( 
CCGAGGACAAGGCCTTCCACTACCGCAAGGCCTCGGAACAGAAGGTGAGGCT( 
********************************************* 

TGGCCCAGGCCCATTTCGGGGTGGAGGAGGTCGTTCTCGTCCTGGAGGGAGAi 
TGGCCCAGGCCCATTTCGGGGTGGAGGAGGTCGTCCTCGTCCTGGAGGGAGAi 
********************************** ******************* 

GCCTGAGCCCAAGGCCCCGCTCGGCCCCACCTCCTGAAGCGCCCGCACCCCCC 
GCCTGAGCCCAAGGCCCCGCCCGGCCCCACCTCCTGAAGCGCCCGCACCCCC( 

******************** *******************************: 

CCGAGGAGGAGGTAGAGGCGGAGGAAGCGGCGGAGGAGGCCCCGGAGGAGGC( 
CCGAGGAGGAGGTAGAGGCGGAGGAAGCGGCGGAGGAGGCCCCGGAGGAGGC( 
****************************************************•: 

GGGTGGTCCGCCTCCTGGGGGGGCGGGTGCTCTGGGTGCGGCGGCCCAGGACC 
GGGTGGTCCGCCTCCTGGGGGGGCGGGTGCTCTGGGTGCGGCGGCCCAGGAC( 
*******************************************-*********: 

CGCCGGAGGAGGAACCCCTGAGCCAAGACGAGATAGGGGGTACTGGTATATAi 
CGCCGGAGGAGGAACCCCTGAGCCAAGACGAGATAGGGGGTACTGGTATATAi 
***************************************************** 

ATGACGCGGACCACCGACCTCGGACAAGAGACCGTGGACAACATCCTCAAGC( 
ATGACGCGGACCACCGACCTCGGACAAGAGACCGTGGACAACATCCTCAAGC( 
****************************************************•: 

CGTATTGAGGGCCAGGTGCGGGGGCTCCAAAAGATGGTGGCCGAGGGCCGCO 
CGTATTGAGGGCCAGGTGCGGGGGCTCCAGAAGATGGTGGCCGAGGGCCGCCC 
***************************** **********************^ 

GAGGTCCTCACCCAGATGACCGCCACCAAGAAGGCCATGGAGGGGGGGGCCA( 
GAGGTCCTCACCCAGATGACCGCCACCAAGAAGGCCATGGAGGCGGCGGCCA( 
****************************************************^ 

CTCCACGAGTTCCTGAACGTCTGCGCCGCCGAGGTCTCCGAGGGCAAGGTGAi 
CTCCACGAGTTCCTGAACGTCTGCGCCGCCGAGGTCTCCGAGGGCAAGGTGAi 
****************************************************-. 

AAGCCCGAGGAGATCGCCACCATGCTGAAGAAGTTCATCTAGATGGGTCGGC'. 

AAGCCCGAGGAGATCGCCACCATGCTGAAGAACTTCATCTA 

******************************** ******** 

CGCCTCCGGCGCCTCCTCCGGGCCCTTCTCGCCCAGGAGGC 2681 



PLEASE NOTE: Showing colors on large alignments is slow. 
[ Show Colors ] [ View Alignment File ] 



Guide Tree 
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[ Show as Phylogram Tree ] [ Show Distances ] [ View DND File 

(McHenry: 0.008 97, Yurieva: 0.00897) ; 



Cladogram 



[ Show as Phylogram Tree ] [ Show Distances ) [ View DND File 

Right-dick on the above tree to see display options. 
Problems printing? Read how to print a Phylogram or Cladogram . 
Please contact EBI Support with any problems or suggestions regarding this site. 
View Printer-friendly version of this page | Terms of Use 
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ClustalW Results 



Results of search 


Number of sequences 


o 


Alignment score 


2701 


Sequence format 


Pearson 


Sequence type 


aa 


ClustalW version 


1.82 


JalView 




Output file 


clustalw-2005051 1 -23053240. output 


Alignment file 


clustalw-2005051 1 -23053240.aln 




Guide tree file 


clustalw-2005051 1 -23053240.dnd 


Your input file 


clustalw-2005051 1 -23053240.input 






f SUBMIT ANOTHER JOB ] 



To save a result file right-click the file link in the above table and choose "Save Target As". 

If you cannot see the JalView button, reload the page and check your browser settings to enable Java Applets. 



Scores Table 

[ View Output File ] 

SeqA Name Len(aa) SeqB Name Len(aa) Score 

1 McHenry 528 2 Yurieva 4 97 98 

PLEASE NOTE: Some scores maybe missing from the above table if the alignment was done using multiple CPU mode. Please check the out, 
[ View Output File ] 



Alignment 

[ Show Colors ] [ View Alignment File ] 

CLUSTAL W (1.82) multiple sequence alignment 

McHenry -SALYRRFRPLTFQEWGQEHVKEPLLKAIREGRLAQAYLFSGPRGVGKTTTARLLAMAV 59 

Yurieva MSALYRRFRPLTFQEVG-QEHVKEPLLKAIRE-RLAQAYLFSGPRGVG-TTTARLLAMAV 57 

*************** ************** *************** *********** 
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rage zoi^ 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



GCQGEDPPCGVCPHCQAVQRGAHPDWEIDAASNNSVEDVRELRERIHLAPLSAPRKVFI 119 
GCQG-DPPCGVCPHCQAVQR-AHPDWDIDAASNNS-EDVRELRERIHLAPL-APRKVFI 113 
**** *************** ******.******** *************** ******* 

LDEAHMLSKSAFNALLKTLEEPPPHVLFVFATTEPERMPPTILSRTQHFRFRRLTEEEIA 179 
LDEAHMLS-SAFNALLKTLEEPPP-VLFVFATTEPERMPP-ILSRTQHFRFRRLT-EEIA 169 
******** *************** *************** ************** **** 

FKLRRILEAVGREAEEEALLLLARLADGALRDAESLLERFLLLEGPLTRKEVERALGLPP 239 
FKLRRILEAVG-RAEEEALLLLARLADG-LRDAESLLERFLLLE-PLTRKEVERALGSPP 22 6 
*********** _*************** *************** ************ ** 

REALAEI AASLARGKTAEALGLARRLYGEGYAPRSLVSGLLEVFREGLYAAFGLAGTPLP 2 99 
-TGVAEIAASLARGKT-EALGLARRLYGEGYA-RSLVSGLLEVFREGL-AAFGLAGTPLP 282 
.********** ** *************** *************** *********** 

A P PQ AL I AAMT ALDE AME RLARRS D AL S LE V ALL E AGRAL AAE AL PQ P TG A P P P E VG P K P 359 
APPQ-LIAAMTALDEAMERLRR-SDALSLEVALLEA-RALAAEALPQPTGAPP-EVGPKP 338 
**** *************** * ************* **************** ****** 

ESPPAPEPPRPEEAPDLRERWRAFLEALRPTLRAFVREARPEVREGQLCLAFPEDKAFHY 419 
ESPPTPEP-RPEEAPDLRERWRAF-EALRPTLRAFVREAR-EVREGQLCLAFPEDK-FHY 394 
******** *************** *************** *************** *** 

RKASEQKARLLPLAQAHFGVEEWLVLEGEKKSLSPRPRSAPPPEAPAPPGPPEEEVEAE 47 9 
RKASEQKVRLLP-AQAHFGVEEWLVLE-EKKSLSPRPRPAPPP-APAPPGPPEEEVEA- 4 50 
*******_**** *************** **********_**** ************** 

EAAEEAPEEALRRWRLLGGRVLWVRRPRTREAPEEEPLSQDEIGGTGI 528 
EAAEEAPEEALRRWR-LGGRVLWVRRPRTRE-PEEEPLSQDEIGGTGI 4 97 
**************** *************** **************** 



PLEASE NOTE: Showing colors on large alignments is slow. 



[ Show Colors ] [ View Alignment File ^) 



Guide Tree 

[ Show as Phylogram Tree ] [ Show Distances ] [ View DND File 

(McHenry: 0.00905, Yurieva: 0.00905) ; 

Cladogram 



McHe 
Yurie\ 



[ Show as Phylogram Tree ] [ Show Distances ] [ View DND File 



Right-click on the above tree to see display options. 

Problems printing? Read how to print a Phylogram or Cladogram . 

Return to site 
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ClustaiW Results 



Results of search 


KlumKor nf com lonroc 


o 


Alignment score 




Sequence format 


Pearson 


Sequence type 


aa 


ClustaiW version 


1.82 


JalView 




Output file 


ciustalw-2005051 1 -23523458. output 


Alignment file 


clustalw-2005051 1 -23523458.aln 


Guide tree file 


ciustalw-2005051 1-23523458.dnd 


Your input file 


clustalw-2005051 1 -23523458.input 






[ SUBMIT ANOTHER JOB ] 



To save a result file right-click the file link in the above table and choose "Save Target As". 

If you cannot see the JalView button, reload the page and check your browser settings to enable Java Applets, 



Scores Table 

[ View Output File ] 

SeqA Name Len(aa) SeqB Name Len(aa) Score 



1 McHenry 454 2 Yurieva 42 6 97 



PLEASE NOTE: Some scores may be missing from the above table if the alignment was done using multiple CPU mode. Please check the out, 
[ View Output File ] 



Alignment 

[ Show Colors ) [ View Alignment File ] 

CLUSTAL W (1.82) multiple sequence alignment 

McHenry -SALYRRFRPLTFQEWGQEHVKEPLLKAIREGRLAQAYLFSGPRGVGKTTTARLLAMAV 5 9 

Yurieva MS AL Y RR FR PLT FQE VG - QE H VKE P LLKA I RE - RL AQ A YL F S G P RG VG - TT T ARL L AMA V 57 

*************** ************** *************** *********** 



i 
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McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



GCQGEDPPCGVCPHCQAVQRGAHPDWEIDAASNNSVEDVRELRERIHLAPLSAPRKVFI 119 

GCQG-DPPCGVCPHCQAVQR-AHPDWDIDAASNNS-EDVRELRERIHLAPL-APRKVFI 113 
**** *************** ******.******** *************** ******* 

LDEAHMLSKSAFNALLKTLEEPPPHVLFVFATTEPERMPPTILSRTQHFRFRRLTEEEIA 179 

LDEAHMLS-SAFNALLKTLEEPPP-VLFVFATTEPERMPP-ILSRTQHFRFRRLT-EEIA 169 
******** *************** *************** ************** **** 

FKLRRILEAVGREAEEEALLLLARLADGALRDAESLLERFLLLEGPLTRKEVERALGLPP 239 

FKLRRILEAVGR-AEEEALLLLARLADG-LRDAESLLERFLLLE-PLTRKEVERALGSPP 22 6 
************ *************** *************** ************ ** 

REALAEIAASLARGKTAEALGLARRLYGEGYAPRSLVSGLLEVFREGLYAAFGLAGTPLP 299 

-TGVAEIAASLARGKT-EALGLARRLYGEGYA-RSLVSGLLEVFREGL-AAFGLAGTPLP 282 
_.************ *************** *************** *********** 

APPQALIAAMTALDEAMERLARRSDALSLEVALLEAGRALAAEALPQPTGAPPPEVGPKP 359 

APPQ-LIAAMTALDEAMERLRR-SDALSLEVALLEA-RALAAEALPQPTGAPP-EVGPKP 338 
**** *************** * ************* **************** ****** 

ESPPAPEPPRPEEAPDLRERWRAFLEALRPTLRAFVREARPEVREGQLCLAFPEDKAFHY 419 

ESPPTPEP-RPEEAPDLRERWRAF-EALRPTLRAFVREAR-EVREGQLCLAFPEDK-FHY 394 
****.*** *************** *************** *************** *** 



McHenry 
Yurieva 



RKASEQKARLLPLAQAHFGVEEWLVLEGEKKSLS 4 54 
RKASEQKVRLLP-AQAHFGVEEVVLVLEEKKKA — 4 26 
******* **** *************** .**. 



PLEASE NOTE: Showing colors on large alignments is slow. 



f Show Colors ] [ View Alignment File ~] 



Guide Tree 

[ Show as Phylogram Tree ] [ Show Distances ] [ View DND File 

(McHenry :0. 01291, Yurieva: 0 .01291) ; 



Cladogram 

— — McHe 

1 Yurie> 

[ Show as Phylogram Tree ] [ Show Distances ] [ View DND File 

Right-click on the above tree to see display options. 

Problems printing? Read how to print a Phylogram or Cladogram . 

Return to site 



http://www.ebi.ac.uk/cg^ 5/11/2005 



rage i oiz 



ClustalW Results 



Results of search 


Number of sequences 


2 


Alignment score 


2343 


wtSlfUcllwC 1 \JI 1 1 lal 


Pparcnn 
i uai owl i 


Sequence type 


aa 


ClustalW version 


1.82 


JalView 




Output file 


clustalw-2005051 1 -2349091 9.output 


Alignment file 


clustalw-2005051 1 -2349091 9.aln 


Guide tree file 


clustalw-2005051 1 -234909 19.dnd 


Your input file 


clustalw-2005051 1 -23490919.input 






[ SUBMIT ANOTHER JOB ] 



To save a result file right-click the file link in the above table and choose "Save Target As". 

If you cannot see the JalView button, reload the page and check your browser settings to enable Java Applets. 

Scores Table 

[ View Output File ] 

SeqA Name Len(aa) SeqB Name Len(aa) Score 

1 McHenry 4 63 2 Yurieva 4 36 97 

PLEASE NOTE: Some scores may be missing from the above table if the alignment was done using multiple CPU mode. Please check the out, 

[ View Output File ] 
Alignment 

[ Show Colors ] [ View Alignment File ] 

CLUSTAL W (1.82) multiple sequence alignment 

McHenry -SALYRRFRPLTFQEWGQEHVKEPLLKAIREGRLAQAYLFSGPRGVGKTTTARLLAMAV 5 9 

Yurieva MSALYRRFRPLTFQEVG-QEHVKEPLLKAIRE-RLAQAYLFSGPRGVG-TTTARLLAMAV 57 

*************** ************** *************** *********** 
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McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



McHenry 
Yurieva 



GCQGEDPPCGVCPHCQAVQRGAHPDWEIDAASNNSVEDVRELRERIHLAPLSAPRKVFI 119 
GCQG-DPPCGVCPHCQAVQR-AHPDWDIDAASNNS-EDVRELRERIHLAPL-APRKVFI 113 
**** *************** ******.******** *************** ******* 

LDEAHMLSKSAFNALLKTLEEPPPHVLFVFATTEPERMPPTILSRTQHFRFRRLTEEEIA 179 
LDEAHMLS-SAFNALLKTLEEPPP-VLFVFATTEPERMPP-ILSRTQHFRFRRLT-EEIA 169 
******** *************** *************** ************** **** 

FKLRRILEAVGREAEEEALLLLARLADGALRDAESLLERFLLLEGPLTRKEVERALGLPP 239 
FKLRRILEAVGR— AEEEALLLLARLADG-LRDAESLLERFLLLE-PLTRKEVERALGSPP 226 
************ *************** *************** ************ ** 

REALAEIAASLARGKTAEALGLARRLYGEGYAPRSLVSGLLEVFREGLYAAFGLAGTPLP 299 
-TGVAEIAASLARGKT-EALGLARRLYGEGYA-RSLVSGLLEVFREGL-AAFGLAGTPLP 282 
.********** ** *************** *************** *********** 

APPQALIAAMTALDEAMERLARRSDALSLEVALLEAGRALAAEALPQPTGAPPPEVGPKP 35 9 
APPQ-LIAAMTALDEAMERLRR-SDALSLEVALLEA-RALAAEALPQPTGAPP-EVGPKP 338 
**** *************** * ************* **************** ****** 

ESPPAPEPPRPEEAPDLRERWRAFLEALRPTLRAFVREARPEVREGQLCLAFPEDKAFHY 419 
ESPPTPEP-RPEEAPDLRERWRAF-EALRPTLRAFVREAR-EVREGQLCLAFPEDK-FHY 394 
****.*** *************** *************** *************** *** 

RKASEQKARLLPLAQAHFGVEEWLVLEGEKKSLEPKAPLGPTS 4 63 
RKASEQKVRLLP-AQAHFGVEEWLVLE-EKKSLEPKAPPGPTS 4 36 
******* **** *************** ********** **** 



PLEASE NOTE: Showing colors on large alignments is slow. 
[ Show Colors ] [ View Alignment File ] 

Guide Tree 



[ Show as Phylogram Tree ] [ Show Distances ] [ View DND File j 

(McHenry: 0.01032, Yurieva: 0.01032) ; 



Cladogram 

j McHe 

I — Yurie> 

[ Show as Phylogram Tree ] [ Show Distances ] [ View DND File ~] 

Right-click on the above tree to see display options. 

Problems printing? Read how to print a Phylogram or Cladogram . 

Return to site 
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This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



